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The 3C-like proteinase of severe acute respiratory syn- 
drome (SARS) coronavirus has been proposed to be a key 
target for structural-based drug design against SARS. In 
order to understand the active form and the substrate 
specificity of the enzyme, we have cloned, expressed, and 
purified SARS 3C-like proteinase. Analytic gel filtration 
shows a mixture of monomer and dimer at a protein con- 
centration of 4 mg/ml and mostly monomer at 0.2 mg/ml, 
which correspond to the concentration used in the en- 
zyme assays. The linear decrease of the enzymatic-spe- 
cific activity with the decrease of enzyme concentration 
revealed that only the dimeric form is active and the 
dimeric interface could be targeted for structural-based 
drug design against SARS 3C-like proteinase. By using a 
high pressure liquid chromatography assay, SARS 3C-like 
proteinase was shown to cut the 11 peptides covering all 
of the 11 cleavage sites on the viral polyprotein with dif- 
ferent efficiency. The two peptides corresponding to the 
two self-cleavage sites are the two with highest cleavage 
efficiency, whereas peptides with non-canonical residues 
at P2 or P1’ positions react slower. The P2 position of the 
substrates seems to favor large hydrophobic residues. 
Secondary structure studies for the peptide substrates 
revealed that substrates with more B-sheetlike structure 
tend to react fast. This study provides a basic understand- 
ing of the enzyme catalysis and a full substrate specificity 
spectrum for SARS 3C-like proteinase, which are helpful 
for structural-based inhibitor design against SARS and 
other coronavirus. 


The outbreak of a severe atypical pneumonia in early 2003 
has caused 8422 cases and 916 related deaths. The World 
Health Organization has designated the illness as severe acute 
respiratory syndrome (SARS).! A novel form of coronavirus has 
been identified as the major cause of SARS (1, 2). The genome 
of SARS coronavirus has been sequenced within a short period 
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of time after confirmation of the virus (3, 4). Currently, 23 
genome sequences of different variations of SARS coronavirus 
have been released at the NCBI web site (www.ncbi.nlm.nih. 
gov/). Coronaviruses are members of positive-stranded RNA 
viruses featuring the largest viral RNA genomes up to date. 
The SARS coronavirus replicase gene encompasses two over- 
lapping translation products, polyproteins 1a (~450 kDa) and 
lab (~750 kDa), which are conserved both in length and amino 
acid sequence to other coronavirus replicase proteins. Polypro- 
teins la and 1ab are cleaved by the internally encoded 3C-like 
proteinase to release functional proteins necessary for virus 
replication. The SARS 3C-like proteinase is fully conserved 
among all of the released SARS coronavirus genome sequences 
and is highly homologous with other coronavirus 3C-like 
proteinase. 

Two crystal structures of coronavirus 3C-like proteinase 
from transmissible gastroenteritis virus (TGEV) (5) and hu- 
man coronavirus (hCoV) 229E have been solved (6). The struc- 
ture of coronavirus 3C-like proteinase contains three domains. 
The first two domains form a chymotrypsin fold, which is 
responsible for the catalytic reaction, and the third domain is 
a-helical with unclear biological function. Coronavirus 3C-like 
proteinase shares the chymotrypsin fold part with the 3C pro- 
teinases from other viruses like rhinovirus called picornavirus 
(7, 8). The 38C proteinase of rhinovirus has been used as a target 
to develop drugs against the common cold (9-15). Because of 
the functional importance of SARS 3C-like proteinase in the 
viral life cycle, it has been proposed to be a key target for 
structural-based drug design against SARS (6). Homology mod- 
eling for the SARS 3C-like proteinase has been performed by 
various groups (6, 16, 17), and the conformational flexibility of 
the substrate-binding site has been studied (17). Virtual 
screening of chemical compounds libraries has given possible 
inhibitors (16). An 8-mer peptide has been docked on the model 
of SARS 3C-like proteinase to study the possible interactions of 
the protein and the substrate (18) 

Similar to other coronaviruses, a sequence analysis revealed 
11 cleavage sites of the 3C-like proteinase on the SARS 
polyprotein. The substrate specificity of coronavirus 3C-like 
proteinase is determined mainly by the P1, P2, and P1’ posi- 
tions (19). The P1 position has a well conserved Gln residue, 
and the P2 position has a hydrophobic one. Unlike other pre- 
viously identified coronavirus 3C-like proteinases, which have 
Leu/Ile at position P2, SARS 3C-like proteinase also tolerates 
Phe, Val, and Met residues at P2 position. To study the sub- 
strate specificity of SARS 3C-like proteinase, we have cloned, 
expressed, and purified the protein and studied its activity 
toward 11 peptides covering the 11 cleavage sites on the virus 
polyprotein. Our results confirm that purified SARS 3C-like 
proteinase is active toward substrate peptides mapped from 
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the cleavage sites on the polyprotein and reveals the substrate 
requirement of the proteinase-binding site. This study helps to 
understand the mechanism of SARS polyprotein process and 
provides clues for drug design. 


EXPERIMENTAL PROCEDURES 


Cloning of SARS-CoV 3C-like Proteinase—The reverse transcrip- 
tional mixture of SARS-CoV RNA from supernatant fluid of the virus- 
infected Vero cells using random primers was generously supplied by 
Dr. Y. Lu from Zhejiang Provincial Center for Disease Prevention and 
Control. For cloning of the cDNA of 3C-like proteinase of the virus, the 
first stand cDNA mixture was subjected to PCR amplification using a 
pair of specific primers, comprising F99 (5'-AGT GGT TTT AGG AAA 
ATG GCA TTC CC-3’) and R108 (5'-TTG GAA GGT AAC ACC AGA 
GC-3') to amplify a 917-bp fragment containing full-length 3C-like 
proteinase coding sequence. The PCR products were purified by agarose 
gel electrophoresis and then cloned directly into pGEM T Easy vector 
(Promega, Madison, WI). The resultant sequence confirmed that the 
amplified fragment was the same as that of SARS-CoV 3C-like 
proteinase. 

Construction of Plasmid pET 3CLP-21x—The gene of the SARS 
3C-like proteinase was amplified by PCR from the cloning vector 
(pGEM T Easy) described above using primers 3CLP-Nhe (5'-CACTG- 
CTAGCGGTTTTAGGAAAATGGCATTCCC-3’) and 3CLP-Xho (5’-CA- 
CTCTCGAGTTGGAAGGTAACACCAGAGC-3’). The PCR product was 
digested with NheI and XhoI and ligated with Ndel/Xhol-digested 
pET21a DNA. The resulting plasmid pET 3CLP-21x encodes a 35.1 kDa 
protein containing a C-terminal His,-Taq. 

Expression of 3C-like Proteinase—pET 3CLP-21x was transformed 
into Escherichia coli BL21(DE3) cells. Cultures were grown at 37 °C in 
1 liter of LB medium-containing ampicillin (100 ug/ml) until the Ago, 
reached 0.8 and then induced with 0.5 mM isopropyl-1-thio-B-D-galac- 
topyranoside at 30 °C for 3 h. The cells were harvested by centrifuga- 
tion at 5000 X g for 10 min. The pelleted cells were suspended in buffer 
A (40 mm Tris-HCl, pH 8.0, 100 mm NaCl, 10 mm imidazole, 7.5 mm 
2-mercaptoethanol), at 2% of the original culture volume. After cell lysis 
by ultrasonic, the cell lysate was separated by centrifugation at 
24,000 x g for 20 min. The filtrated supernatant was applied to a 
nickel-nitrilotriacetic acid column (Qiagen) equilibrated by 50 ml of 
buffer A. After being washed in 100 ml of buffer A, the 3C-like protein- 
ase was eluted with the gradient of 1-100% buffer B (40 mm Tris-HCl, 
pH 8.0, 100 mm NaCl, 250 mM imidazole, 7.5 mm 2-mercaptoethanol). 
The eluted enzyme was concentrated and loaded on a gel filtration 
column Sephacryl S-200 HR (Amersham Biosciences) equilibrated by 
180 ml of buffer C (40 mm Tris-HCl, pH 8.0, 100 mm NaCl, 7.5 mm 
2-mercaptoethanol). After elution with another 180 ml buffer C, we 
received over 95% purified 3C-like proteinase. 

Analytic Gel Filtration—The aggregation state of the SARS 3C-like 
proteinase was analyzed using a Superdex 75 HR column (Amersham 
Biosciences) on AKTA fast protein liquid chromatography. Freshly pu- 
rified protein was diluted to 4 and 0.2 mg/ml and equilibrated at room 
temperature for 2 h. 400 pl of 4- and 2-ml 0.2 mg/ml samples were 
injected into the Superdex 75 HR column and eluted with the buffer (40 
mM Tris-HCl, pH 8.0, 100 mm NaCl, 7.5 mm 2-mercaptoethanol) at a 
flow rate of 0.5 ml/min. The eluted peaks were monitored at 280 nm on 
fast protein liquid chromatography. 

CD Spectra—All of the CD spectra of the proteinase and the sub- 
strate peptides were recorded on a Jobin Yvon CD 6 spectrometer at 
20 °C. The CD spectra of 3C-like proteinase were recorded in 40 mm 
Tris-HCl buffer, pH 8.0. For near-UV CD spectrum, a cell with a path 
length of 1 mm was used and the proteinase concentration is 544 uM, 
whereas a cell with a path length of 0.1 mm and 54.4 uM of proteinase 
solution was used for far-UV CD spectrum. The substrate peptides were 
solved in 20 mo Tris-HCl buffer, pH 7.3, and the final concentration 
was 2 mM. A cell with a path length of 0.1 mm was used. Each spectrum 
was the average of four scans corrected by subtracting a spectrum of the 
buffer solution in the absence of proteinase/peptide recorded under 
identical condition. Each scan in the range of 184—260 nm for far-UV 
CD and of 250-320 nm for near-UV CD spectra was obtained by taking 
data points every 0.5 nm with integration time of 1 s and a 2-nm 
bandwidth. Thermal denaturation spectrum was recorded by CD at 218 
nm using the same condition for far-UV CD spectrum from 10 to 90 °C 
with an interval of 0.5 °C. Secondary structure content was calculated 
using the program VARSLC1 (20) 

Synthesis of Substrate Peptides—The substrate peptide S01 was 
synthesized by solid-phase peptide synthesis using standard Fmoc (N- 
(9-fluorenyl)methoxycarbonyl)/tert-butyl strategy (21). The cleavage of 
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the peptide from Rink resin and removal of all of the side-chain pro- 
tecting groups were achieved in trifluoroacetic acid solution. The crude 
peptide was purified by reversed-phase high performance liquid chro- 
matography (RP-HPLC, LabPrep System, Gilson) on a Vydac C18 semi- 
preparative column (218TP510, 10 by 250 mm, Vydac) with gradients of 
water/acetonitrile containing 0.1% trifluoroacetic acid. Peptide homo- 
geneity and identity were analyzed by analytical HPLC and matrix- 
assisted laser desorption/ionization time-of-flight mass spectroscopy 
(MALDI-TOF MS), respectively. Other substrate peptides of HPLC 
purity from S02 to S11 were purchased from GL Biochemistry Ltd. 
(Shanghai, China). 

Peptide Cleavage—The proteolysis activity of the SARS 3C-like pro- 
teinase was determined by peptide cleavage assay. Peptide S01 (Table 
T) was used as substrate and was incubated with the enzyme in Tris- 
HCl buffer, pH 7.3, at room temperature. The cleavage mixture was 
analyzed by RP-HPLC. To verify the cleavage site on the substrate 
peptide, the two products were purified by semi-preparative RP-HPLC 
using a 15-min 0-50% linear gradient of acetonitrile in 0.1% trifluoro- 
acetic acid and lyophilized. The relative molecular weights of the prod- 
ucts were identified by MALDI-TOF MS (BIFLEX III time-of-flight 
mass spectrometer, Bruker). 

The relative enzyme activity at different pH values was determined 
in citric acid/phosphate buffer (pH 5, 6, 7, and 8) or glycine/NaOH buffer 
(pH 9 or 10) containing 6.8 mm dithiothreitol, 2 mm S01 as substrate, 
and 2.14 uM SARS 3C-like proteinase with a final volume of 50 pl. The 
cleavage reaction was stopped after 20 min by the addition of 50 yl of 
0.1% trifluoroacetic acid aqueous solution and analyzed by RP-HPLC 
(LabPrep System, Gilson) on a Zorbax C18 analytic column (4.6 x 250 
mm, Agilent). Cleavage products were resolved using a 15-min 0-50% 
linear gradient of acetonitrile in 0.1% trifluoroacetic acid. 

To determine the k,,,/K,, for the substrate, 0.2 mm of substrate 
peptide was incubated with SARS 3C-like proteinase in 40 mm Tris-HCl 
buffer, pH 7.3. The concentration of the enzyme varied from 0.90 to 22.5 
uM because of the different cleavage activity to different substrates. 
Reaction aliquots were removed at different times within 7 h and 
analyzed by RP-HPLC as described above. k,,,/K,, was determined by 
plotting substrate peak area as Equation 1, 


InPA = C — Regat/Ky (Eq. 1) 


where PA is the peak area of the substrate peptide, c, is the total 
concentration of 3C-like proteinase, and C is an experimental constant. 
K,,, and k,,, of the proteinase for selected substrates were determined 
by incubation of the substrate peptide at different concentration vary- 
ing from 2 to 0.1 mm with SARS 3C-like proteinase in 40 mm Tris-HCl 
buffer, pH 7.3, for 20 min and analyzed by RP-HPLC as described 
above. The concentration of the enzyme varied from 1.07 to 17.1 um 
because of the different cleavage activity to different substrate. Peak 
areas were calculated by integration and converted to absolute units by 
using peptide standards. The reaction rate was calculated for all of the 
cleavage products of two experiments and averaged. K,,, and k,,, were 
calculated by the Lineweaver-Burk plot. 


cat 


RESULTS AND DISCUSSION 


Biosynthesis, Purification, and Secondary Structures of Re- 
combinant SARS CoV 3CL Proteinase—The C-terminal His 
tagged SARS 3C-like proteinase has been successfully ex- 
pressed in EF. coli and purified. Induction was first done at 
37 °C. As a result, the majority of 3C-like proteinase can be 
found in the insoluble fraction of the cell lysate. Induction with 
isopropy]-1-thio-8-D-galactopyranoside was then done at 30 °C 
for 3 h. As a result, most of the 3C-like proteinase was found in 
the soluble fraction. The protein was purified by nickel column 
followed by gel filtration on a Sephacryl S-200 HR column (Fig. 
1). Approximately 10 mg of purified protein can be obtained 
from 1-liter cells. The protein can be concentrated to 10 mg/ml 
in 50 mn Tris-HCl, 0.1 M NaCl, and 1 mn dithiothreitol, pH 7.3. 

The CD spectra of SARS 3C-like proteinase were shown in 
Fig. 2. Far-UV and near-UV CD spectra show that the purified 
protein has well defined secondary and tertiary structures. 
Far-UV CD spectrum (Fig. 2a) shows a positive peak at 196 nm 
and two negative peaks at 209 and 222 nm, respectively, which 
clearly indicates for a mixed a and 8 structure. Calculated 
secondary structure content shows 26% of a -helix and 23% of 
B -sheet, similar to those in the TGEV 3C-like proteinase crys- 
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Fic. 1. Expression and purification of the SARS 3C-like pro- 
teinase. Samples taken at each step of purification were analyzed on a 
12% SDS-polyacrylamide gel, and the protein was stained with Coo- 
massie Brilliant Blue. Lane M, protein molecular mass marker; lanes 1 
and 2, cell lysates from non-induced and isopropyl]-1-thio-B-D-galacto- 
pyranoside-induced E. coli BL21(DE3) containing the expression plas- 
mid pET 3CLP-21x; lanes 3 and 4, the supernatant and the deposition 
separated from the cell lysate by centrifugation; lane 5, collected peak 
fractions from the nickel-nitrilotriacetic acid column; lane 6, collected 
peak fractions from the Sephacryl S-200 HR column. 


tal structure (22 and 26%, respectively). Near-UV CD spectrum 
(Fig. 26) shows a broad positive peak at ~280 nm and a small 
positive shoulder at 291 nm, indicating a well folded tertiary 
structure. Thermal melting of the protein by monitoring the 
CD signal at 218 nm (Fig. 2c) gives a sigmoid denaturation 
curve and a T,, of 61°C, indicating a highly cooperative 
thermodenaturation. 

Aggregation State and Enzyme Activity—Because both the 
crystal structures of the 3C-like proteinase in TGEV and hu- 
man coronavirus give dimer structure and the residues at 
dimeric interface are conserved in coronavirus, it has been 
proposed that the dimer may be the biological functional form 
of the protein (5, 6). Dynamic light-scattering experiment 
shows that both hCoV 229E and TGEV 3C-like proteinases 
exist as a mixture of monomer (65%) and dimer (35%) at a 
concentration of 1-2 mg/ml (6). We have performed analytical 
gel filtration of SARS 3C-like proteinase at different concen- 
trations using a Superdex 75 column (Fig. 3). At the concen- 
tration of 4 mg/ml, two peaks corresponding to the monomeric 
and dimeric form of the protein appeared while only one peak 
corresponding to the monomeric form was found at a lower 
concentration of 0.2 mg/ml. The dissociation constant of the 
dimer was estimated to be around 100 um. Jn vitro peptide or 
protein cleavage assays of coronavirus 3C-like proteinase usu- 
ally were performed with a protein concentration at micromo- 
lar level (22, 23). In this study, we also used a peptide cleavage 
assay with the SARS 3C-like proteinase concentration not 
>22.5 um that corresponds mainly to the monomer form of the 
enzyme. This raises an interesting question whether the minor 
amount of the dimer plays a major role in catalysis, although 
we are still not clear about the exact form of the protein in the 
cell under a molecular crowding environment. 

To answer this question, the proteolysis activity of SARS 
CoV 3C-like proteinase at different enzyme concentrations was 
studied. Peptide S01 was used as substrate and was incubated 
with the proteinase in Tris-HCl buffer, pH 7.3, at room tem- 
perature. The cleavage mixture was analyzed by RP-HPLC. 
The peak area of the substrate SO1 decreases as reaction time 
increases, whereas the area of two newly formed peaks in- 
creases. The products were collected and identified by MALDI- 
TOF MS with experimental (M + H)* of 593 and 618, respec- 
tively, that were identical to the theoretical (M + H)* of the 
C-terminal pentapeptide and N-terminal hexapeptide. This 
confirms that the substrate is cleaved by the SARS CoV 3C-like 
proteinase at the predicted fragile Gln-Ser peptide bond. 
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Fic. 2. CD spectra of SARS 3C-like proteinase. The CD spectra of 
3C-like proteinase were recorded in 40 mm Tris-HCl buffer, pH 8.0 at 
20 °C. a, far-UV CD spectrum shows a positive peak at 196 nm and two 
negative peaks at 209 and 222 nm, respectively, indicating a mixed a 
and f structure. b, near-UV CD spectrum shows a broad positive peak 
at ~280 nm and a small positive shoulder at 291 nm. c, thermal melting 
of the protein by monitoring the CD signal at 218 nm gives a sigmoid 
denaturation curve and a T,,, of 61°C 


The observed k,,,/K,, was determined as described under 
“Experimental Procedures” at different proteinase concentra- 
tions between 4.5 and 0.9 uM (shown in Fig. 4). The observed 
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Fic. 3. Aggregation state of SARS 3C-like proteinase on ana- 
lytic gel filtration column. Superdex 75 HR column was used on 
AKTA fast protein liquid chromatography for the analysis. Solid line, 4 
mg/ml; dot-and-dash line, 0.2 mg/ml. Two peaks corresponding to mo- 
nomeric and dimeric states were detected at 4 mg/ml, whereas at 0.2 
mg/ml, only one peak corresponding to the monomeric state appears. 
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Fic. 4. Enzyme activity of SARS CoV 3C-like proteinase at 
different concentrations. The proteolysis activity of expressed SARS 
CoV 3C-like proteinase at different concentrations was determined by 
incubating 0.2 mM S01 with 3C-like proteinase in 40 mm Tris-HCl 
buffer, pH 7.3. The observed .,,,/K,, increases in a linear manner, 


cat! m 


whereas the enzyme concentration increases, indicating the dimer form 
is the active form of the proteinase. 


Reat/K,, increases linearly with the increase of the enzyme 
concentration. Using the estimated dissociation constant of 
~100 uM, we can deduce from the linear dependence that the 
monomeric form of SARS CoV 3C-like proteinase almost has no 
catalytic activity, whereas the k,,,/K,,, of the dimeric form is 
~1.4 X 10° mm! min™?!. This result supports the previous 
prediction that the dimer of the 3C-like proteinase is the active 
form of the enzyme and also explains that the low in vitro 
activity of CoV 3C-like proteinases compared with other mono- 
meric virus 3C proteinases is because of the low concentration 
of the active dimeric form under the assay conditions. 
Substrate Specificity—Eleven 11-mer peptides derived from 
the 11 cleavage sites of 3C-like proteinase upon the SARS 
polyprotein have been synthesized and tested for SARS 3C-like 
proteinase cleavage experiment (Table I). The proteolysis ac- 
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tivity of expressed SARS CoV 3C-like proteinase at different 
pH was tested as described under “Experimental Procedures.” 
When peptide S01 was used as substrate as shown in Fig. 5, the 
proteinase had the highest activity at around pH 7 and de- 
creased at another pH, resulting a bell-shaped curve. When pH 
decreased to 5, the proteolytic activity of the enzyme decreased 
to 10% because of the protonation of the catalytic His-41. 

To investigate the relationship between the proteolysis ac- 
tivity and the secondary structure of the substrates, far-UV CD 
spectra were recorded for all of the peptides at 2 mm (for S09 
the peptide concentration was 0.2 mm because of its low solu- 
bility) (see Fig. 6a). The secondary structure contents were 
calculated using VARSLC1 (20) with the CD data in the range 
of 184-260 nm for each peptide with the exception of S09 
(which has very low solubility) and summarized in Table II. 
Although not obvious in the CD spectra, each peptide forms 
more or less sheet structures. The calculated sheet content 
reflects the formation of a B-strandlike extended conformation 
in equilibrium with other turnlike or random conformations. 
Fig. 6b shows an interesting tendency in which substrate pep- 
tide with small k,,,/K,,, has relatively less content of sheet but 
more of other unordered structure, suggesting that the forma- 
tion of B-strandlike extended structure is favored for binding to 
the enzyme and proteolysis. As shown in the crystal structure 
of TGEV 3C-like proteinase, residues P5 to P3 of the substrate 
form an antiparallel B-sheet with segment 164-167 of the long 
strand ell of the enzyme on one side and with segment 186-191 
on the other side (6). The binding modes of substrate to differ- 
ent coronavirus 3C-like proteinases were predicted to be iden- 
tical by comparing the crystal structure of the substrate-bind- 
ing regions of the free proteinases of hCoV and SARS CoV and 
of TGEV proteinase in complex with a hexapeptidyl chlorom- 
ethyl ketone inhibitor (6). Substrate peptide that tends to form 
an extended #-strandlike conformation would be the preferred 
choice to bind with the enzyme, resulting in a small K,, and a 
large k,,,/K,,,. It is also supported by a previous study (25) on 
3C proteinase in which helix-destabilizing residues were often 
found in close proximity to cleavage sites. 

Of all of the 11 peptides tested, SO1 and S02 were the most 
suitable substrates for SARS CoV 3C-like proteinase cleavage, 
which were derived from the N-terminal and C-terminal self- 
cleavage sites, respectively. K,, of S01 was determined as 
1.15 + 0.28 mm, which is three times larger than that of hCoV 
229E 3C-like proteinase for a 15-mer substrate, 0.39 + 0.07 
mm. The relative large K,, was counteracted by a large k..;, 
12.2 + 2.9 min ~!, which is an order larger than that of the five 
other substrates whose £,,, and K,,, values are also determined 
by the Lineweaver-Burk plot (Table I). 

Most of the cleavage sites of coronavirus polyprotein have a 
conserved (Leu/Ile)-Gln | (Ser, Ala, or Gly) core sequence (the 
cleavage site is indicated by | ). However, the SARS coronavi- 
rus polyprotein has three noncanonical cleavage sites with Phe, 
Val, or Met in the P2 position and one noncanonical cleavage 
site with Asn in the P1' position. Here, peptides S02, S03, and 
S07 are derived from the three noncanonical cleavage sites at 
P2 and peptide S05 is derived from the one noncanonical cleav- 
age site at Pl’. With the dipeptide sequence of Asn-Asn at 
positions P1’ and P2’, peptide S05 has relatively low catalytic 
efficiency, which is the second lowest one among all of the 11 
peptides. Peptides S02 and S03 have a Phe and a Val, respec- 
tively, in the position P2 and have similar cleavage activity as 
other substrates with Leu in P2 position, indicating that Phe or 
Val can also be well fitted in the large hydrophobic S2 pocket so 
that the substitutions are tolerated for 3C-like proteinase 
cleavage. The substitution by a relative large hydrophobic 
amino acid Phe did not affect the substrate binding, whereas 
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TABLE I 
Catalytic parameters and relative cleavage efficiencies of eleven peptides that represent all of the eleven cleavage sites 
in SARS coronavirus polyprotein 


Peptide Cleavage site Sequence me Reat Real Km (Rea/K,,)rel 
mM min? mau? min? 
S01 P1/P2 TSAVLQ/SGFRK-NH, 1.15 + 0.28 12.2 + 2.9 10.6 + 0.67 1.00 
S02 P2/P3 SGVTFQ/GKFKK 0.583 + 0.086 2.55 + 0.35 4.38 + 0.26% 0.41 
S03 P3/P4 KVATVQ/SKMSD 0.353 + 0.013° 0.03 
S04 P4/P5 NRATLOQ/AIASE 0.556 + 0.126° 0.05 
S05 P5/P6 SAVKLOQ/NNELS 0.202 + 0.003° 0.02 
S06 P6/P7 ATVRLOQ/AGNAT 1.44 + 0.47 3.29 + 1.07 2.29 + 0.09% 0.22 
S07 P7/P8 REPLMQ/SADAS 0.0176 + 0.0022° 0.002 
S08 P8/P9 PHTVLQ/AVGAC 1.94 + 0.66 1.68 + 0.57 0.865 + 0.028° 0.08 
so9 P9/P10 NVATLQ/AENVT 0.976 + 0.014° 0.09 
S10 P10/P11 TFTRLOQ/SLENV 0.286 + 0.020 0.847 + 0.051 2.96 + 0.11° 0.28 
S11 P11/P12 FYPKLO/ASQAW 0.549 + 0.105 1.57 + 0.28 2.86 + 0.21% 0.27 


“ Determined by Lineweaver-Burk plot. 


’ Determined by fitting reaction kinetic data as in Equation 1 at low substrate concentration. 
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Fic. 5. Enzyme activity of SARS CoV 3C-like proteinase at 
different pH values. The proteolysis activity of expressed SARS CoV 
3C-like proteinase at different pH values was determined in citric 
acid/phosphate buffer (pH 5, 6, 7, and 8) or glycine/NaOH buffer (pH 9 
or 10) containing 6.8 mM dithiothreitol, 2mm S01 as substrate, and 2.14 
pM 38C-like proteinase. 


substitution by a relative small hydrophobic amino acid Val 
decreased the cleavage activity, indicating an important role of 
pocket S2 in anti-SARS drug design. 

On the other hand, peptide S07, which has a methionine in 
the P2 position, is the substrate with least cleavage activity 
among 11 substrates. The low cleavage activity of SO7 may be 
caused by the mutant from Leu to Met, although it may also be 
affected by other reasons such as different secondary struc- 
tures. As shown in the far-UV CD spectrum, peptide S07 may 
adopt an unusual helix-like, type II polyproline conformation, 
which has a strong negative peak at ~200 nm and a positive 
peak at ~230 nm. The formation of the polyproline helix will 
disturb the conformation transition to B-strandlike-extended 
structure of the substrate peptide and disable the binding of 
the substrate to the enzyme. It is also supported by the factor 
that this peptide has the least content of sheet. 

Hegyi and Ziebuhr (19) have studied the conservation of 
substrate specificities among coronavirus 3C-like proteinase 
from three coronavirus by using peptides corresponding to the 
four cleavage sites on the viral polyprotein. Our findings that 
the two peptides S01 (P1/P2) and S02 (P2/P3) corresponding to 
the two self-cleavage sites of the SARS 3C-like proteinase are 
the two most reactive ones and S05 (P5/P6) is less reactive are 
comparable to their results. This implies that SARS 3C-like 
proteinase follows the same substrate specificity rules govern- 
ing all of the coronavirus 3C-like proteinase. As we have stud- 
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Fic. 6. The relationship of secondary structure contents in 
substrate peptides with cleavage efficiency. a, far-UV CD spectra 
of peptide substrates. @, S01; @, S02; A, S05; V, S07; and @, S11. 6, 
calculated content of secondary structure versus k,,,/K,,, profile. This 
figure shows an interesting tendency of which substrate peptide with 
small k,,,/K,, has relatively less content of sheet (Ml) but more content of 
unordered structure (O), suggesting that the formation of B-strandlike 
extended structure is favored for binding to the enzyme and proteolysis. 


ied, the substrate specificities of SARS 3C-like proteinase using 
peptides covering all of the 11 cleavage sites try to correlate not 
only the conserved sequences but also secondary structures of 
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TABLE IT 
Calculated secondary structure contents for the 11-peptide substrates 
Helix Sheet Tum Other 

sol 5 35 27 32 
S02 3 42 24 31 
S03 2 25 28 46 
S04 2 24 30 45 
S05 10 19 27 44 
S06 2 25 30 43 
S07 dt 18 40 41 
S08 5 21 29 45 
$10 4 22 33 41 
S11 2 30 32 36 


the substrate peptides. The full spectrum of substrate specific- 
ity for SARS 3C-like proteinase studied here can be extended to 
understand other coronavirus 3C-like proteinases. 

In summary, the recombinant SARS 3C-like proteinase has 
been successfully cloned and expressed. The purified protein 
exists in a mixture of monomer and dimer at a concentration of 
4 mg/ml but mostly monomer at 0.2 mg/ml. The specific activity 
of the enzyme decreases linearly with the decrease of enzyme 
concentration implying that only the dimeric form is active and 
that the dimeric interface could be targeted for structural 
based drug design against SARS 3C-like proteinase. The en- 
zyme can cut the 11 peptides covering all of the 11 cleavage 
sites on the viral polyprotein with different efficiency. Sub- 
strates with a more f£-sheetlike structure tend to react fast. The 
P2 position of the substrates seems to favor large hydrophobic 
residues. This study provides basic understandings of the en- 
zyme catalysis and substrate specificity for SARS 3C-like pro- 
teinase and helpful information for structurally based inhibitor 
design. 
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